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O ' Abstract 

This paper describes an approach for half-duplex cooperative transmission in a classical three-node 
relay channel. Assuming availability of channel state information at nodes, the approach makes use of this 

x 

5_i ' information to optimize distinct flows through the direct link from the source to the destination and the 



path via the relay, respectively. It is shown that such a design can effectively harness diversity advantage of 
the relay channel in both high-rate and low-rate scenarios. When the rate requirement is low, the proposed 
design gives a second-order outage diversity performance approaching that of full-duplex relaying. When 
the rate requirement becomes asymptotically large, the design still gives a close-to-second-order outage 
diversity performance. The design also achieves the best diversity-multiplexing tradeoff possible for the 
relay channel. With optimal long-term power control over the fading relay channel, the proposed design 
achieves a delay-limited rate performance that is only 3.0dB (5.4dB) worse than the capacity performance 
of the additive white Gaussian channel in low- (high-) rate scenarios. 
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I. Introduction 

It is well known that the performance of a wireless network can be significantly improved by 
cooperative transmission among nodes in the network. Many cooperative transmission designs aim to 
exploit cooperative diversity that is inherently present in the network. Such designs have been suggested 
in [1], [2] for cellular networks. Recently there has been much interest in achieving cooperative diversity 
in a classical three-node relay channel [3], which represents the simplest wireless network that can derive 
advantages from cooperative transmission. 

The relay channel has been thoroughly studied in [3]. Bounds on the capacity have been given for the 
general relay channel, and the capacity has been calculated for the special case of degraded relay channels. 
The coding techniques suggested in [3] assume that the relay can operate in a full-duplex manner; i.e., 
it can transmit and receive at the same time. It is commonly argued that full-duplex operation is not 
practical for most existing wireless transceivers. Thus the restriction of half-duplex operation at the relay 
is usually considered in cooperative transmission designs. 

Since the relay cannot transmit and receive simultaneously, a time-division approach is employed in 
half-duplex relaying [4]. The source first transmits to the destination, and the relay listens and "captures" 
[5] the transmission from the source at the same time. Then the relay aids the transmission by sending 
processed source information to the destination. Note that the source may still send data to the destination 
when the relay transmits. Several techniques to process and forward the received data by the relay have 
been suggested. These techniques include the decode-and-forward (DF) and amplify-and-forward (AF) 
approaches [4]. In the DF approach, the relay decodes the received signal from the source and then 
forwards a re-encoded signal to the destination. In the AF approach, the relay simply amplifies and 
forwards the signal received from the source to the destination. 

The performance of the DF approach is limited by the capability of the relay to correctly decode the 
signal received from the source. This in turn depends on the quality of the link from the source to the 
relay. On the other hand, the AF approach performs poorly in low signal-to-noise ratio (SNR) situations 
in which the relay forwards mainly noise to the destination. In addition, the time-division approach leads 
to rate losses that are significant when the relay channel is to support high rates. Some enhanced versions 
of the AF and DF approaches have been proposed to solve the rate loss problem. A distributed space- 
time-coding protocol is developed in [6]. An incremental AF technique which requires feedback from 
the destination to the source is developed in [4]. The non-orthogonal AF and dynamic DF techniques 
suggested in [7] allow the duration of the relay listening to the transmission from the source to adapt 
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to the condition of the link from the source to the relay. In particular, the dynamic DF technique is 
shown to be superior to all the cooperative diversity techniques (except perhaps the incremental relaying 
techniques) mentioned above. A bursty AF technique is also suggested in [8] to solve the noise forwarding 
problem of the AF approach when the SNR is low. It is shown that the bursty AF technique achieves 
the best outage performance at the asymptotically low SNR regime. We note that all these cooperative 
diversity techniques mentioned so far are designed with the constraint that channel state information is 
not available at the source and the relay. Some practical code designs for the DF and space-time-coding 
approaches have been suggested in [9] and [10], respectively. 

When the links in the relay channel suffer from slow fading, it is conceivable that the channel state 
information (or at least the channel quality information) can be estimated and passed to the nodes. The 
source and relay may then use this information to optimize the cooperative protocol to achieve better 
performance. Such a design has been considered in [5], in which optimal power control is performed at 
the source and relay in order to maximize the ergodic rates achieved by the DF and compress-forward 
approaches. 

In this paper, we assume that the channel state information is available, and we develop time-division 
cooperative diversity designs that perform well in both high-rate and low-rate scenarios. The main 
distinguishing feature of the proposed approach, compared with the cooperative designs mentioned above, 
is that we do not employ the approach of the relay "capturing" the transmission from the source to the 
destination. Instead, we divide the information to be sent to the destination into three flows. The source 
employs cooperative broadcasting [11], [12] to intentionally send two distinct flows of data to the relay 
and destination, respectively, in the first time slot. The relay helps to forward, in the DF manner, the data 
that it receives to the destination in the second time slot, during which the source concurrently sends the 
remaining flow of data to the destination. The transmit powers of the source and relay as well as the 
durations of the time slots are optimized according to the link conditions and the rate requirement. This 
constitutes a form of optimal flow control. 

Due to the DF nature of the proposed design, there is an implicit restriction on the decoding delay. 
Thus we will employ the capacity-versus-outage framework [13], [14] to evaluate the performance of the 
proposed design. We will show that the proposed design can efficiently achieve cooperative diversity in 
both high-rate and low-rate scenarios. In particular, when the rate requirement is asymptotically small, 
the outage performance of the proposed design approaches that of full-duplex relaying with DF, giving 
a second-order diversity performance. On the other hand, when the rate requirement is asymptotically 
large, the proposed approach still gives a close-to-second-order diversity performance. Moreover, the 
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design also gives the best diversity-multiplexing tradeoff [15] possible for the relay channel. Together 
with the application of optimal long-term power control [16], the design can give very good delay-limited 
rate performance again in both low-rate and high-rate scenarios. 

We note that the two basic building blocks for the proposed approach are cooperative broadcasting 
(CB) in the first time slot and multiple access (MA) in the second time slot. The combination of CB and 
MA allows distinct flows of data be sent through the relay and through the direct link from the source to 
the destination, and hence can be viewed as a generalized form of routing. A practical advantage of the 
proposed design is that the basic building blocks are the well known CB and MA approaches. Practical 
MA coding designs have been well studied, e.g. see [17], [18], while practical CB coding designs are 
currently available [19]— [21]. 

II. Relay Channel: Full-Duplex Bounds 

Consider a classical three-node relay network, which consists of a source node 1, a relay node 2, and 
a destination node 3 as shown in Fig. [1] We assume that each link in the figure is a bandpass Gaussian 
channel with bandwidth W and one-sided noise spectral density Nq. Let Zij denote the power gain of the 
link from node i to node j. The link power gains are assumed to be independent and identically distributed 
(i.i.d.) exponential random variables with unit mean. This corresponds to the case of independent Rayleigh 
fading channels with unit average power gains. The results in the sequel can be easily generalized to 
include the case of non-uniform average power gains. 

In this section, we consider the case in which the relay node is capable of supporting full-duplex 
operation. Our goal is to support an information ratdj of K nats/s/Hz from the source to the destination. 
We assume that the link power gains change slowly so that they can be estimated, and hence the power 
gain information is available at all nodes. The source and relay nodes can make use of this information 
to control their respective transmit power and time so that the total transmit energy is minimized. For 
convenience, we consider a slotted communication system with unit-duration time slots. Let P t be the total 
transmit energy of the source and relay needed to support the transmission of KW nats of information 
from the source to the destination in a time slot. Since the duration of a time slot is one, P t is also 
the total average transmit power required. Although interpreting P t as the total average power may not 
carry any significant physical meaning, it is customary to speak of "power" rather than "energy" in for 



'Strictly speaking, the word "rate" here should be replaced by "spectral efficiency", since the unit involved is nats/s/Hz. 
Nevertheless we will use the terminology "rate" throughout this paper for convenience. 
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communication engineers. Unless otherwise stated, we will hereafter consider a normalized version of 
Pt, namely the rate-normalized overall signal-to-noise ratio (RNSNR) of the network: 

,A_^ l_ 

N W e K - 1 ' 

The RNSNR can be interpreted as the additional SNR, in dB, needed to support the required rate of K 
nats/s/Hz, in excess of the SNR required to support the same rate in a simple Gaussian channel with unit 
gain. This normalization is convenient as we will consider asymptotic cases when K approaches zero 
and infinity. 

We note that the use of the RNSNR to characterize our results has two important implications. First, 
since the total transmit energy of the source and relay is used in defining the RNSNR, no individual 
power limits are put on the source and relay. The results in this paper can be viewed as bounds if 
additional individual power limits are imposed. Our choice of focusing on the total energy comes from 
the viewpoint that the relay channel considered forms a small component of a larger wireless network. In 
this sense, it is fairer to compare the total transmit energy incurred in sending information from the source 
to the destination by employing cooperative diversity to that incurred in direct transmission. Second, the 
normalization by the factor e K — 1 implies that the additional SNR in dB to combat fading can only be 
a constant over the SNR required to achieve the target rate in a Gaussian channel, regardless of the rate 
requirement. That is, we restrict the SNR to increase at the same rate as in a Gaussian channel to cope 
with increases in the transmission rate through the relay channel. In a sense, this restriction enforces 
efficiency of energy usage. 

Employing well known capacity bounds on the relay channel [5], [3], [22], we can obtain the following 
bounds on the RNSNR to support required spectral efficiency of -fTnats/s/Hz. 

Theorem 2.1: For any fixed positive link power gains Z\2, Z\%, and Z23, define the bound 

Z\2 + -^23 



Zyi{Z\3 + Z23) 

— — otherwise, 
^ Zn 



and 

A Z\2 + Z\3 + Z23 



lb 



(Z\2 + Zi3)(Zi3 + Z23) 

Then S > i?DF is a sufficient condition in order to support the rate of Knats/s/Hz from the source to 
destination. Also S > B^ is a necessary condition in order to support the rate of iTnats/s/Hz from the 
source to destination. 

Proof: See Appendix lAl ■ 
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The RNSNR -Bdf is achieved by the DF approach employing the block Markov coding suggested in [3], 
[22]. We also need to optimally allocate transmit energy between the source and relay nodes. In addition, 
the availability of channel state information (both magnitudes and phases of the fading coefficients of 
all three links) as well as symbol timing and carrier phase synchronization at all the three nodes are 
implicitly assumed. The lower bound is based on the max-fiow-min-cut bound in [22]. No known 
coding technique can achieve this bound. 



In this section, we will consider the more practical scenario in which the relay node operates in the 
following half-duplex fashion. We partition each unit time slot into two sub-slots with respective durations 
t± and t2, where t\ + t2 = 1. In the first time slot, the source transmits while the relay and destination 
receive. In the second time slot, the source and relay transmit, and the destination receives. Based on this 
half-duplex mode of operation, we will describe two cooperative communication protocols that make use 
of the two basic components of cooperative broadcasting (CB) from the source to the relay in the first 
time slot and multiple access (MA) from the source and relay in the second time slot. The first protocol 
does not require phase synchronization among the three nodes, while the second protocol does so. 

A. Half-Duplex Protocol 1 (HDP1) 

In this protocol, the information from the source to the destination is divided into three flows of data 
X\, X2, and X3, where x\ +X2 + X3 = K. In the first time slot, the source sends, via CB, two flows of rates 
x\/t\ and X2A1 to the destination and relay, respectively. In the second time slot, the relay and source 
send, via MA, two flows of rates X2 / £2 and X3 / £2 to the destination, respectively. The information flow 
of rate X2/*2 sent by the relay in the second time slot is from the flow of rate X2A1 that it receives and 
decodes in the first time slot. We choose t\, £2, x\, X2, and X3 to minimize the total power transmitted 
by the source and relay to support the rate i^nats/s/Hz from the source to the destination. 

To determine the minimum RNSNR that can support the required rate when this protocol is employed, 
we start with the following lemma. 

Lemma 3.1: 1) For < t\ < 1, the infimum of the SNR required so that the source can broadcast 
at rates x\/t\ and X2A1 to the destination and relay, respectively, in the first time slot is 



III. Half-Duplex Protocols based on Flow Control 



( 1 



(e X2/tl - 1) + -Le^/^e^Ai - 1) if Z 13 > Z 
Z13 

(gxi/ti -1)+ e x,/t^ e x 2 /u _ ^ otherwise. 



Scb = { Z { 2 



12, 
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For ii = 0, Scb = 0. 

2) For < t 2 < 1, the inflmum of the SNR required so that the source and relay can simultaneously 
transmit at rates xs/t 2 and x 2 /t 2 , respectively, to the destination in the second time slot is 

' J_(e* a /fa _ 1) + J-fto/t,^*/* _ 1) if Zl3 > Z 23 , 

Sma = < Z f t 13 

— {e X3/t2 - 1) + - 1) otherwise. 

V ^13 -^23 

For t 2 = 0, S MA = 0. 

Proof: See Appendix iBl ■ 
With the help of Lemma 13.11 we can now formulate the optimization of the parameters in Protocol 1 
as follows: 

min tiScB + t 2 S M A 
subject to i. total data requirement: x\ + x 2 + X3 = K 

ii. total time requirement: t\ + 1 2 = 1 

iii. non-negativity requirements: x\, x 2 ,x^, t\, t 2 > 

where Scb and Sma are of the forms in Lemma [3TT1 It is not hard to see that (Q]) is a convex optimization 
problem and its solution provides the tightest lower bound for the SNR required to support the rate of 
A'nats/s/Hz: 

Theorem 3.1: Let B\(K) be the minimum value achieved in the optimization problem £T|), normalized 
by the factor e K — 1. Then B\{K) is the infimum of the RNSNR required so that the rate of iTnats/s/Hz 
can be supported from the source to the destination by HDP1. 

1 ) Description of B\{K): To describe the form of the RNSNR bound B\{K), we need to consider the 
following few cases. This solution is established by applying the Karush-Kuhn-Tucker (KKT) condition 
[23] to the convex optimization problem (Q]) as detailed in Appendix For notational convenience, we 
write 

n ~ x ~ y 

as the harmonic mearo of two real numbers x and y. 
a) Z13 > Mh(Z± 2 , Z 2 z): The solution is given by 

x\ = Kti, 
x 2 = 0, 
23 = Kt 2 , 



2 The definition here actually gives one half of the harmonic mean usually defined in the literature. For convenience, we will 
slightly abuse the common terminology and call Mn(x,y) the harmonic mean. 
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^23 



where ti and £2 can be arbitrarily chosen as long as they satisfy the non-negativity and total-time 
requirements. This corresponds to directly transmitting all data through the link from the source to 
destination, without utilizing the relay. The resulting value of B\(K) is 

Bi(K) = -L 

^13 

b) Zi 3 < M H {Z 12 ,Z 23 ): Define 

At = Z 23 
A 2 = Z\ 2 

Notice that A\ > 1 and A2 > 1. Consider two sub-cases: 
i. K > M H {logA 1 ,logA 2 ): 
In this case, 

Bi{K) = , (2) 

where the three SNR terms S\(K), S 2 (K), and S 3 (K) are respectively defined in ©, ©, and © 
below. 

The first SNR term is given by 

Si(K)= min (— e ^+(i-*OiogA 2+ 1 ^logA, _J_\ (3 ) 

— {O,!-^}^^!^,!} 1^12 ^23 ^13 J 

Define 



. Z 23 logA 2 s . 

!og j- +log^ 2 

\zl 2 logAl 



log Ai + log A 2 

Employing the well known inequalities log x < x — 1 for x > 1 and log x > 1 — - for x > 0, 
it can be shown that < t* < 1. By simple calculus, t* is the minimizing t\ in (f3]) above when 
max|o, 1 — i Q g Ai I < t* < min | lo ^4 2 , l|. When t* lies outside that range, the minimizing t\ 
must be one of the boundary points. When S\(K) is the minimum among the three terms inside 
the min operator in (0, the corresponding solution to the optimization problem (OQ) is given by 

Xl = Kt* -t*)\ogA u 

x 2 =t*(l-t*)log(A 1 A 2 ), 

x 3 = K(l-t*)-t*(l-t*)logA 2 , 

with ti = t* and t 2 = 1-1*. 



s 



The second SNR term is given by 



mm < 



S 2 (10 = min ^_i_ e */ti+ e A+tuog Al _ i (4) 

J-Likt^l 1^12 ^23 ^13 Z 23 

\ log A 2 ' J — ~ 



Write the minimizing value of ii in the expression above as t**. When §2(K) is the minimum 
among the three terms inside the min operator in (O, the corresponding solution to the optimization 
problem (Q} is given by 

Xl = Kt** -t**{l- h) log A 1 , 

X2 = K(l-t**)+ t**(l - t**) log Ax, 

x 3 = 0, 

with ii = t** and t 2 = l-t**. 
The third SNR term is given by 

S 3 (K)= min \^Il e K/(i-t 1 ) + 1 e ^+(i- tl )iogA 2 _l^l -A). (5 ) 

Write the minimizing value of t\ in the expression above as t***. When S3 (K) is the minimum 
among the three terms inside the min operator in (O, the corresponding solution to the optimization 
problem (Q} is given by 

xi = 0, 

x 2 = Kt*** +t***(l -t***)\ogA 2 , 

x 3 = K(l-t***) -t***(l-t***)logA 2 , 

with ti = t*** and t 2 = 1 - t*** . 
ii. K < M H QogA 1 AogA 2 ): 
In this case, 



B min{5,(/0 g,(lQ.S,(*)} 

e K — 1 



where the three SNR terms S\(K), S 2 (K), and S 3 (K) are respectively defined in ©, dHJ), and © 
below. 

The first SNR term is given by 



Si (#) = min 



+ i- tl 



•^23 



(7) 



log A2 logAi v 

Write the minimizing value of t\ in the expression above as i*. When <§i(if) is the minimum 
among the three terms inside the min operator in (0, the corresponding solution to the optimization 
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problem £T|) is given by 



x\ = 0, 
x 2 = K, 
x 3 = 0, 

with ti = i* and f 2 = 1 — £*. 
The second SNR term is given by 

5 2 (K)= min /iL e */* 1+ 1 K +tl log^_ *L_i^*i j_ (8) 

l-j^q-<ii<l 1-^12 ^23 ^13 -^23 J 

Write the minimizing value of t\ in the expression above as i**. When S^K) is the minimum 
among the three terms inside the min operator in (O, the corresponding solution to the optimization 
problem £0) is given by 

x\ = Kt^ — t**(l — log Ai 3 

x 2 = K(l - t**) + t**(l - i**) log(Ai), 

x 3 = 0, 

with ti = i** and t 2 = 1 — i**. 
The third SNR term is given by 



S 3 (*) = min K {!^e*/<i-tO + 1 

0<*!<;^ I ^23 



e K+(l-h ) log A 2 _ 1 ~*1 _ _*L \ (9) 
^12 #13 #12 J 

Write the minimizing value of t\ in the expression above as t^. When S 3 (K) is the minimum 
among the three terms inside the min operator in the corresponding solution to the optimization 
problem £T|) is given by 

xi = 0, 

X 2 — Kty:** -f" ^^^(l ) lOg A 2 , 

x 3 = K(l — i***) — i*** (1 — i***) log A 2 , 
with ti = i*** and t 2 = l — t***. 
2) Asymptotic-rate scenarios: We are interested in characterizing the required RNSNR in the 
asymptotic scenarios as the required rate K approaches zero and infinity, respectively. The following 
corollary of Theorem 13. II and the description of B\(K) above provides such characterization: 

Corollary 3.1: 1) B\(K) is continuous and non-decreasing in K for all K > 0. 

+ ^- if Z l3 < M H (Z l2 , Z 23 ) 
2) lim B^K) = { Z f Zl2 

M ^- if Z 13 >M H (Z 12 ,Z 23 ). 

V ^13 
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1-t* 



3) lim Bi(K) = { z p 



Zl2 



if Zi 3 < M H (Z 12 ,Z 23 ) 



if Z 13 >M H (Z 12 ,Z 23 ). 



4) -Bi(iC) is continuous (except at Z\ 3 = Z\ 2 = Z 23 = 0) and non-increasing in each of Z\ 3 , Z\ 2 
and Z 23 for all Z n , Z u , Z 23 > 0. 



From the solution of the optimization problem described in Section IIII-A. 1 1 (see the form of solution 
under ©), we observe that for a sufficiently low rate requirement, the most energy-efficient transmission 
strategy is to select between the direct link from the source to the destination and the relay path from 
the source to the relay and then to the destination. The choice of which path to take is determined by 
comparing the power gains of the two paths. We note that the power gain of the relay path is specified 
by the harmonic mean of the power gains of the links from the source to the relay and from the relay to 
the destination. The form of lim^^o B\ (K) in part 2) of Corollary 13.11 also suggests this strategy. 

When the rate requirement is sufficiently high, the optimal strategy (see the form of solution under 
©) is again to compare the path gains of the direct and relay paths. If the direct path is stronger, all 
information is still sent through this path. Different from the low-rate case, if the relay path is stronger, 
most of the information is still sent through the direct path. Only a fixed amount (depends on the link 
power gains, but not on the rate regardless of how high it is) of information is sent through the relay path. 
The reduction of this fixed amount of data through the direct path has the equivalent effect of improving 
the fading margin of the direct path and hence provides diversity advantage. Unlike the low-rate case, 
this strategy is not readily revealed by the form of rim#r_ >00 B\(K) in part 3) of Corollary 13.11 

B. Half-Duplex Protocol 2 (HDP2) 

In this protocol, the information from the source to the destination is again divided into three flows of 
data x\, x 2 , and x 3 , where x\ + x 2 + x 3 = K. In the first time slot, the source sends, via CB, two flows 
of rates X\jt\ and x 2 /t\ to the destination and relay, respectively, as before. In the second time slot, the 
relay sends the information that it receives in the first time slot to the destination with a flow of rates 
x 2 /t 2 . The source, on the other hand, simultaneously sends two flows of information to the destination in 
the second time slot. The first flow is the exact same flow of rate x 2 / 1 2 sent by the relay. The other flow 
has rate x 3 /t 2 containing new information. Like before, we choose t\, t 2 , x\, x 2 , and x 3 to minimize 
the total power transmitted by the source and relay to support the rate iTnats/s/Hz from the source to 
the destination. 

To send the same flow of data (with rate x 2 /t 2 ) in the second time slot, the source and relay use 



Proof: See Appendix |Pl 
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the same codebook. The codeword symbols from the source and relay are sent in such a way that the 
corresponding received symbols arrive at the destination in phase and hence add up coherently. In order to 
do so, the source and relay need to be phase synchronized and to have perfect channel state information 
of the links. We note that these two assumptions are also needed in the full-duplex approach described 
in Section ITT] In addition, the codebooks used by the source to send the two different flows in the second 
time slot are independently selected so that the transmit power of the source is the sum of the power of 
the two codewords sent. 

Since the transmission procedure is the same as that of HDP1 in the first time slot, Lemma [3TT1 part 
1) gives the minimum SNR that can support the required CB transmission in the first time slot. The 
minimum SNR required in the second time slot is given by the following lemma: 

Lemma 3.2: For < t<i < 1, suppose that the source transmits a flow of data at rate xs/t2 to the 
destination in the second time slot. Then the infimum of the SNR required so that the source and relay 
can jointly send another in-phase flow of data at rate X2 / £2 to the destination in the second time slot is 

Sua = ^(e X3/t2 - 1) + _ \ _ e a */ fa (e a »/ fa - 1). 

^13 z 13 + z 23 

For t 2 = 0, Sma = 0. 

Proof: See Appendix [EJ ■ 
Let us define Z23 = Z13 + Z23. Then we note that the expression of Sma above can be obtained by 
putting Z23 in place of Z23 in the expression of Sma in Lemma 13.11 This means that as far as minimum 
SNR is concerned, HDP2 is equivalent to HDP1 with the power gain of the link from the relay to the 
destination specified by Z23 instead. Using this equivalence, we obtain the following counterparts of 
Theorem [3J] and Corollary [3J] for HDP2: 

Theorem 3.2: Let B2(K) be obtained by replacing Z23 with Z23 in the description of B\(K) given 
in Section |IlI-A.l| Then B2{K) is the infimum of the RNSNR required so that the rate of A'nats/s/Hz 
can be supported from the source to the destination by HDP2. 

We note that B2(K) < B\{K) since HDP1 can be seen as an unoptimized version of HDP2 with zero 
power assigned to the transmission of the flow of rate X2/*2 from the source to the destination during 
the second time slot. 

Corollary 3.2: 1) B 2 (K) is continuous and non-decreasing in K for all K > 0. 

[ + if Z n <M H {Z 12 ,Z 2 3) 

2) lim B 2 {K) = I 423 Z 12 

K ~* -j- if Z lz >M H (Z 12 ,Z23). 
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3) lim B 2 (K) 



+ hr~ if z i3 < Mh (Zi 2 , z 23 ) 

— if Z 13 >M H (Z 12 ,Z 23 ). 

^13 



4) B 2 {K) is continuous (except at Z13 = Z\ 2 = Z23 = 0) and non-increasing in each of Z13, Z12 

and Z23 for all Z n , Z u , Z 23 > 0. 
In parts 2) and 3), A\, A 2 , and t* are the same as A\, A 2 , and t*, respectively, with Z23 replaced by 

■^23- 

IV. Performance Analysis 

In this section, we evaluate the performance of HDP1 and HDP2, particularly in comparison to that 
of full-duplex relaying. As mentioned previously, we model the link power gains Z\ 3 , Z\ 2 , and Z23 as 
i.i.d. exponential random variables with unit mean. The fading process is assumed to be ergodic and 
varies slowly from time slot to time slot. Hence the minimum RNSNR needed to support a given rate, or 
equivalently the maximum achievable rate for a given RNSNR, is a random variable. Thus we need to 
consider its distribution. Moreover, the two protocols, namely HDP1 and HDP2, considered in Section ITIT1 
are based on the DF approach. The relay needs to decode in the first time sub-slot and then re-encode 
to forward to the destination in the second sub-slot. Hence the decoding delay is implicitly limited to 
onqj time slot. As a result, the maximum ergodic rate achieved with optimal power control and infinite 
decoding delay [14] does not apply here. Instead we will consider the capacity-versus-outage approach 
of [13] (see also [14]) that leads to performance measures like the outage probability [13], e-achievable 
rate [16], diversity-multiplexing tradeoff [15], and delay-limited achievable rate [16]. 

A. Outage probabilities 

Outage probability is defined as the probability of the event that the rate K cannot be supported at 
the RNSNR S. Let us denote the outage probabilities of full-duplex relaying, full-duplex relaying with 
DF, half-duplex relaying using HDP1, and half-duplex relaying using HDP2 by P^K, S), Pbf(K,S) 
Pi(K,S), and P 2 (K,S), respectively. Then by Theorems 12. U [3~T1 and 13.21 we have 

P lh (K, S) = Pr(S < Bib) < Pm(K, S) < Pv(S < B DF ) = P DF (K, S) 

Pi(K,S) = Pv(S<B l (K)) 

P 2 (K,S) = Pr(S<B 2 (K)). 



3 It is possible for the relay to store the signal for a few time slots before decoding, and then forward the decoded data to 
the destination in the next few time slots. Nevertheless the decoding delay still needs to be finite. We do not consider this time 
diversity approach here as we are primarily interested in the space diversity provided by the relay channel. 
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Using these, we can obtain the following bounds on the outage probabilities. Let f(x) and g(x) be real- 
valued functions and a be a constant. We say that the function f(x) is of order ag(x) asymptotically, 
denoted by f{x) ~ 0{ag{x)), if linx^oo f(x)/g(x) = a. Moreover, we denote the z^th-order modified 
Bessel function of the second kind by K u {x). 
Theorem 4.1: 1) For all K > 0, 



Pfd(K, S) > Pi h (K, S) > 1 - 2e~ * + e ~ ? ~ O 



1 



2) For all K > 0, 



P BF (K,S)>l-e----e-s ~Q 



1.5 

52 



3) For all K > 0, 

Px(K,S) < l-e 



+ 



2zK 1 (2z)e- 3z dz 
4 



2z^i(2z)e 



-2z 



5* 2 ; 



4 



e 



4) For all if > 0, 



Pi(K,S) > 





2 f2\ 




1 " 




> 






1 — e s 













e~ z dz ~<D 



41ogg 



Equality above is achieved when K approaches 0. 
5) For all K > 0, 

P2(K,S) < l-e~s- f S 2zK 1 (2z)e- 2z dz 



+ 



±2 

s 



2zA"i(2z)e" 



-2.Z 



6) For all K > 0, 



F 2 (ir,S) > 1-e" 



5 2 z 



e s 



e s 2 







dz ~ O 



41ogS 



1.5 

52 



Equality above is achieved when K approaches 0. 

Proof: See Appendix IB ■ 
The various bounds in this theorem are illustrated in Fig. [2] 

For comparison purpose, it is easy to verify that the outage probability for direct transmission from 
the source to destination is P^{K, S) = Pr(S < I/Z13) = 1 — e« ~ O (i). From parts 1) and 6) of the 
theorem, we see that O (i) < Pfd(K, S) < O (^f). Hence full-duplex relaying provides a second-order 
diversity outage performance as expected. In addition, when the rate requirement is small and the RNSNR 
is large, the loss in outage performance due to the restriction of half-duplex relaying is at most 0.9dB by 
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using HDP2. If phase synchronization between the source and relay is impractical, then employing HDP1 
results in an additional loss of about 0.6dB. Comparing parts 2) and 6), we see that HDP2 achieves the 
same outage performance as full-duplex relaying based on DF at asymptotically small rates. All these 
observations are readily illustrated in Fig. [2] 

When the rate requirement increases, the loss of half-duplex relaying starts to increase. In Figs. [3] and 
IU we plot the outage probabilities achieved using HDP1 and HDP2, respectively. In each of the figures, 
we include the outage probabilities when the rate requirement approaches 0, 1, 3, 6, and oo bits/s/Hz. 
For comparison, we also plot the lower bounds on outage probabilities for full-duplex relaying in parts 1) 
and 2) of Theorem 12. 11 and the outage probability for direction transmission in the figures. All the results 
corresponding to HDP1 and HDP2 in the figures are obtained using Monte Carlo calculations. From 
Fig. [3l for HDP1, we see that the loss, with respect to full-duplex relaying at the outage probability of 
10~ 4 , is at most 1.5dB at 1 bits/s/Hz. The loss increases to about 2.7dB and 4.2dB when the rate increases 
to 3 and 6 bits/s/Hz, respectively. A similar trend is observed in Fig. [4] for HDP2. At 1 bits/s/Hz, the 
loss is about 0.8dB. The loss increases to 2.2dB and 4.1dB when the rate increases to 3 and 6 bits/s/Hz, 
respectively. Moreover, at all the values of K considered for both HDP1 and HDP2, the simulation results 
seem to indicate that the outage probability is of the order of O {^) for some constant a, whose value 
is different for the different cases. 

When the rate requirement becomes asymptotically large, Theorem 14.11 parts 3) and 5) state that the 
outage probabilities for HDP1 and HDP2 are at most of order O ^ 41 g| ,g ^ _ This implies that they both 
give close-to-second-order diversity performance at asymptotically high rates. From the simulation results 
shown in Figs. [3] and [4] it appears that the outage probabilities for both HDP1 and HDP2 do in fact have 
the order of O (^§f^), where a is about 2.85. This corresponds to a performance loss of about 5dB at 
the outage probability of 10 -4 , and the bound in parts 3) and 5) is about 0.8dB loose (cf. Fig. [2]). We also 
note that HDP2 does not improve the outage performance, compared to HDP1, at asymptotically large 
rates. This is contrary to the finite rate cases in which HDP2 does provide performance advantage over 
HDP1, although the amount of advantage decreases as the rate requirement increases. In summary, HDP1 
seems to be of higher practical utility than HDP2 since the former does not require phase synchronization 
between the source and relay, while it only suffers from a performance loss of about 0.6dB. 
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B. e-achievable rates 

By using the standard sampling representation [22], the input-output relationship of the 3-node relay 
channel over a time slot can be written as 

Y n = y/Z^X? + y/Z^X$ + N n 

Yx = y/Z^X? + N?, (10) 

where Xf, XV;, Y n , Y™, N n , and are the n-element transmit symbol vector at the source, transmit 
symbol vector at the relay, receive symbol vector at the destination, receive symbol vector at the relay, 
Gaussian noise vector at the destination, and Gaussian noise vector at the relay, respectively. The 
dimension n = 2W and is assumed to be large. Conditioned on the link gain vector Z = [Z±^, Z\2, Z23], 
the channel is memoryless and described by the Gaussian conditional pdf PY*,Yf\x?,x$,z(y n i Ui\ x i-> ^2)- 
An (n, M n , e n , P n )-code [27] over a time slot is one that consists of the encoding and decoding functions 
described in [3] allowing one of M n messages to be sent from the source to destination in a time slot, and 
achieves the average (averaged over all codewords sent at the source and relay, link and noise realizations) 
error probability of e n , while the maximum (over all codewords) total transmit energy used in the time 
slot does not exceed P n . Since the link gain vector is available at all nodes, we allow the transmit powers 
of the source and relay to vary as functions of the link gains. This corresponds to the application of 
power control [16]. As a result, the power control scheme is included implicitly in the code, and P n can 
be in general a function of the link gain vector Z. In most cases, we are interested in performing power 
control to minimize e n . The rate K is (e, P t ) -achievable if there exists a sequence of (n, M n , e n , P n )- 
codes satisfying lim sup^^ e n < e, lim inf^^ ^ log M n > K, and lim sup^^ P n < Pt almost 
surely (a.s.). 

For half-duplex relaying, when the relay listens (transmits), its transmit (receive) symbols are restricted 
to zero. In the previous sections, we have assumed that the relay first listens for t\ seconds in 
a time slot and then transmits in the remaining t 2 seconds. In this case, it is more convenient 
to describe the channel by the CB and MA conditional pdfs, p yfl „ z (y* in , yj in |a:* in ) and 

p Y t 2 ™\x t2n x* 2 " z(y* 2 "1 x i 2n ' x 2 2 "0> f° r tne ^ rst an d secon d sub-slots, respectively. We will also say that 
the rate K is (e, P t ) -achievable with HDP1 (HDP2) if there exists a sequence of (n, M n , e n , P„)-codes 
with the CB and MA coding in the first and second sub-slots as described in HDP1 (HDP2), satisfying 
lim sup,,^ e n < e, lim inf „^oo \ log M n > K, and lim sup n ^ oc P n < P t a.s. 

The theorem below states that the various e-achievable rates are characterized by the corresponding 
outage probabilities defined in the previous section. 
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Theorem 4.2: 1) For all e > and < 5 < 1, if the rate K is (e, P t ) -achievable, then 
e>SP lh (K, J^k^ s). 

2) For all e > 0, the rate if is (e, P t ) -achievable if Pv F (K, S) < e. 

3) For all e > 0, the rate if is (e, P t ) -achievable with HDP1 if Pi (if, S) < e. 

4) For all e > 0, the rate if is (e, P t ) -achievable with HDP2 if P 2 (K, S) < e. 

Proof: See Appendix iGl ■ 



C. Diversity-multiplexing tradeoff 

It is also interesting to investigate the diversity-multiplexing tradeoff of [15] for HDP1 and HDP2. To 
this end, we need to follow [4] to change the parameterization of the outage probabilities from (if, S) 
to (if, S), where S is the SNR and K is the multiplexing gain (0 < K < 1) defined by 

log(l + S) 

With the parameterization (K, S), the diversity orders [15] achieved by HDP1 and HDP2 are defined as 

A,(*) = I im -'°ff *\ 

S^oo log 6 

where P^{K, S) is the average error probability of HDPi at SNR S and multiplexing gain K, for i = 1 
and 2, respectively. Then the diversity orders can be readily obtained in the following corollary of 
Theorems @J] and |4~21 

Corollary 4.1: For i = 1 and 2, Aj(JK') = 2(1 - K). Hence HDP1 and HDP2 achieve the maximum 
diversity advantage possible for the relay channel when link gain information is available at all nodes. 
Proof: First, by Theorem |4~2l P*(K,S) < Pi(K,S) for i = 1 and 2. Also notice that since 

S = S(e K -1), S = — — g-jr -. Hence 5 ~ O (s 1 -^. As a result, applying parts 3) - 6) of 

Theorem 14.11 with the parameterization (K,S), for sufficiently large S and i = 1,2, 

4(1 -if) logS \ 

where a\ = 2 and a2 = 1.5. Applying — log, dividing the result by log S, and finally taking limit as 
S —>■ oo on each item in the inequality equation above give Ai(K) > 2(1 — K) for i = 1 and 2. On 
the other hand, part 1) of Theorem 14. 1 1 and part 1) of Theorem 14.21 force the error probability of any 
transmission scheme over the relay channel to be larger than 50 \ -^rrw] ^ for all < 5 < 1. As a result, 
the maximum possible diversity order of any transmission scheme over the relay channel is 2(1 — K). 
Thus we have the desired result. ■ 



°(iii?by) £P -< A "^ ( 
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D. Delay-limited rates 

When the average error probability decreases to zero, the e-achievable rates becomes the delay-limited 
rates [16]. We calculate the delay-limited rates achievable by HDP1 and HDP2 in this section. 

We first employ the following definition [27] as our definition of delay limited rate: The rate K is Pi- 
achievable if there exists a sequence of (n, M n , e n , P n )-codes over a time slot, satisfying lim n _>oo e n = 0, 
liminfj^oo - logM n > K, and limsupn^oo P n < Pt a.s. . Unfortunately, part 1) of Theorem 14.21 forces 
the P t -achievable rate of any transmission scheme over the relay channel to be zero as long as P t is 
finite. This is due to the Rayleigh fading nature of the links and the restriction that the total transmit 
energy in each time slot needs to be bounded by P t . It turns out that more meaningful results can be 
obtained if we relax the latter restriction. 

Recall that the link power gains vary independently from time slot to time slot. With power control to 
maintain the error probability e n , the total transmit energy P n (a function of Z) of an (n, M n , e n , P n )- 
code may vary from time slot to time slot. This may require the total transmit energy to be very large 
in the worst faded time slots. As a relaxation of the transmit energy constraint, we require the average 
total transmit energy per time slot over many time slots to be bounded. Then the ergodicity of the fading 
process requires E[P n \ to be bounded. This relaxation motivates the following definition: The rate K is 
long-term Pi-achievable if there exists a sequence of (n, M n , e n , P n ) -codes over a time slot, satisfying 
lim^oo e n = 0, limimV+oo ± logM n > K, and limsup^^ E[P n ] < P t . 

The following theorem then specifies the delay-limited rates achievable by HDP1 and HDP2: 

Theorem 4.3: For K > 0, define 

Pf\K) = E[B lh ](e K -l)N W 

P? F (K) = E[B DF ](e K -1)N W 

Pt(K) = E[B x {K)]{e K -1)N W 

Ff{K) = E[B 2 (K)](e K - 1)N W. 

Then P\ h (K) < PP F (K) < P?(K) < P?(K) < oo. 

1) If the rate K is long-term p-achievable, then P t > P} b (K). 

2) The rate K is long-term Pp F (K) -achievable. 

3) The rate K is long-term P/(if) -achievable with HDP1. 

4) The rate K is long-term Pt(K) -achievable with HDP2. 

Proof: See Appendix IH1 ■ 
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In Fig. |5l we plot the rate K against j^ffi for i £ {lb, DF}. For i € {1, 2}, notice that 

£[^(0)](e* - 1) < < - 1) 

by Corollaries 13.11 and 13.21 respectively. In each case, we plot the lower and upper bounds instead. The 
true P ^Jw curve lies between the bounding curves. Also the true curve approaches the lower bound 
when K is small and the upper bound when K is large. For comparison, we also plot the curve jf^jp 
which corresponds to the SNR required to achieve the rate K in an additive white Gaussian noise 
(AWGN) channel with unit power gain. Thus, at each rate, the loss of performance, with respect to an 
AWGN channel, in dB for approach % due to link fading and the restriction of one-slot decoding delay 
is E[Bi]$B = 101og 10 (i£[-Bi]). The results obtained from numerical calculations are shown in Table J] 
From the table, we see that the loss when employing full-duplex relaying is between 2.17dB and 2.76dB, 
where the upper limit on the loss can be achieved by DF with optimal power control. The loss when using 
HDP1 with optimal power control ranges from 3.33dB to 5.45dB, while the loss when using HDP2 with 
optimal power control is between 3.02dB and 5.36dB. The loss of performance of half-duplex relaying 
with respect to full-duplex relaying is at most 3.28dB. This loss happens when the rate requirement is 
very large and HDP1 is employed. When the rate is very small, the loss drops down to at most 0.85dB 
with the use of HDP2. With the delay-limited rate as performance measure, HDP1 once again appears to 
be a good tradeoff between complexity and performance. The maximum loss when using HDP1 instead 
of HDP2 is only 0.31dB. 

V. Conclusions 

With channel state information available at all nodes, we have shown that a half-duplex cooperative 
transmission design, based on optimizing distinct flows through the direct link from the source to the 
destination and the path via the relay, can effectively harness diversity advantage of the relay channel 
in both high-rate and low-rate scenarios. Specifically, the proposed design gives outage performance 
approaching that of full-duplex relaying using decode-and-forward at asymptotically low rates. When 
the rate requirement becomes asymptotically large, the design still gives a close-to-second-order outage 
diversity performance. The design also gives the best diversity-multiplexing tradeoff possible for the relay 
channel. With optimal long-term power control over the fading relay channel, the proposed design can 
give delay-limited rate performance that is within a few dBs of the capacity performance of the additive 
white Gaussian channel in both low-rate and high-rate scenarios. 

In addition to the good performance, a perhaps more important advantage of the proposed relaying 
design is that only flow-level design is needed to optimize the use of the rather standard components 
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of cooperative broadcasting and multiple access. This advantage makes generalizations of the design to 
more-complicated relay networks manageable. In general, the availability of channel information at the 
nodes appears to simplify cooperative transmission designs. Thus it is worthwhile to investigate how to 
effectively spread the channel state information in a wireless network. 



Appendix 

A. Proof of Theorem \2.1\ 

First consider the sufficient condition in Theorem 12.11 We consider two different cases: 
1) Z\2 > Z\ 3 : Let < a < 1 be the fraction of the total energy, P t , allocated to the source node. 
Then the transmit energy of the relay node is (1 — a)Pt- By [3, Theorem 1] (also see [5]), the following 
rate is achievable by the relay channel when the relay decodes and re-encodes its received signal: 

R DF (a) = ^ax^ min jc aZ i3 + (1 - a)Z 23 + 2a/ (1 - (3)a(l - a)Z l3 Z 23 , 

where C(x) = log(l + x). We can further maximize the rate by optimally allocating transmit energy 
between the source and relay, i.e., 

i?DF = max Rr>p(a) 

0<a<l 

= C f^Af? max min \ aZ ™ + (1 - «)^23 + 2^(1 - 0)a(l ~ a)Z 13 Z 23 , af3Z l2 \] , 

\1\qW 0<a,/3<l I J 1 

where the second equality results from the fact that C{x) is an increasing function. Hence, the requirement 
that the RNSNR satisfies S > 1/Z DF is sufficient for _R DF = K, where 

Z DF = max min I aZ 13 + (1 - a)Z 23 + 2a/ (1 - /3)a(l - a)Zi 3 Z 23 , a(5Z 12 \ ■ (12) 

0<a,/3<l L J 

Thus it reduces to solving the optimization problem in (fT2~l) . 



To solve (fT2l . we write Z DF (a) = max min |aZi3 + (1 — 0)^23 + 2a/ (1 — P)a(l — a)Zi 3 Z 23 , aj3Z\ 2 
and consider two cases: 

^23 

a) < a < : Under this case, the second term inside the min operator is smaller 

Z 23 + Z\ 2 — Z\ 3 

than the first term for all < (3 < 1. Hence Zy> f (ql) = maxo</3<i af3Z\ 2 = aZ\ 2 . 



20 



^23 

b) — — < a < 1: Under this case, notice that the first and second terms inside the 

^23 + Z12 — Z13 

min operator are strictly decreasing and increasing in 0, respectively. Moreover the two terms equalize 
at some < f3* < 1. Hence Z DF (a) = a^Zyi- Solving for the equalizing we get 

,Z\3 



Z DF (a) = aZ 13 + (1 - a)Z 23 1-2 

^12 



Now we maximize Z DF (a) over < a < 1. For case a), maxZ DF (a) 




■^23 + Z\2 — Z13 

direct but tedious calculation shows that maxZ DF (a) = . It is not hard to verify that the 

a ' * Z12 + Z23 

Zj\2\Z\3 -\- Z23) 

maximum value in case b) is larger than the maximum value in case a). Hence Z DF = , 

^12 + ^23 

^12 1 ^23 

and the sufficient condition is S > -. 

Zl2\Zl3 + Z23) 

2) Z\2 < Z\3. First note that the capacity of the relay channel is upper bounded by the maximum 
sum rate of the CB channel from the source to the relay and destination. This CB channel is a degraded 
Gaussian broadcast channel, and the individual rates R13 and R12 from the source to the destination and 
relay, respectively, satisfy [22, Ch. 14] 

'aZ 13 P t ' 



R13 < C 



N W 



for any < a < 1. To have R13 + R12 > K, we need 



faZnPA ( (1 - a)Z 12 P t 
K < max < C — 77-^77- + C ' 



0<«<i [ \ N W J \aZ 12 P t + NqW / 



log 

C ( Z 13 



1 + Z12 — j ■ max 



NoWJ o<a<i yi + aZ^j^ 



'N Wj ' 

where the last equality is obtained by choosing a = 1, due to the condition that Z13 > Zu- Hence 
S > \jZ\3. This lower bound corresponds to sending all information directly form the source to the 
destination without using the relay. 

For the necessary condition, we employ the max-flow-min-cut bound of [22, Theorem 14.10.1] to obtain 
an upper bound, R\b(a), on the rate of the relay channel. It turns out [5] that the expression for i?ib(a) is 
obtained simply by replacing every occurrence of Z12 by Z12 + Z13 in (fTTb above. In addition, the power 
optimization procedure in case 1) above carries through directly for this case with every occurrence of 

Z\2 ~t~ Z\3 ~\~ Z23 

Z\2 replaced by Z\2 + Z\3. Thus we obtain the necessary condition as S > — -. 

{Z12 + Z\3)[Z\3 + Z23) 
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B. Proof of Lemma I3.il 

1) The case of ti = trivially requires X\ = %2 = 0, and hence Sqb = 0- So we consider < t\ < 1. If 
Z13 > Z\2, we have a degraded broadcast channel during this time slot. Thus rate constraints in ([T3T ) 
must be satisfied with R\% = x\jt\ and R\2 = x^/ti. Combining the two inequalities to remove a, 
it is easy to obtain the stated lower bound Scb of the SNR P t /NoW '. We note that this lower bound 

corresponds to the optimal choice a = n rr — ^-77 • Interchanging 

F F (e X2/tl - 1)/Z 12 + e X2/tl (e Xl/tl - 1)/Z 13 5 5 

the roles of Z13 and Z12, we get the stated SNR lower bound for the case of Z13 < Zi 2 . 

2) The case of i 2 = trivially requires x 2 = X3 = 0, and hence ^ma = 0. So we consider < i 2 < 1. 
The capacity region of this Gaussian MA channel is specified by [22, Ch. 14]: 

%3 c ( aZ 13 P t 



t 2 V N oW 

X2 c { (I - a)Z 23 Pt\ 



t 2 V N W )> 

X2 + X3 f [aZ n + (1 - a)Z 23 ]PA 

for any < a < 1, where a and 1 — a are the fractions of the transmit power assigned to the 
source and relay, respectively. We want to optimally choose a so that the SNR P t /NoW required 
to satisfy ([141 is minimized. First, suppose that Z13 > Z 23 . Then rearranging the second and third 
inequalities in (O gives 



a < 1 - — e X2/Z2 - 1 



1 r^/t, A N W 



1 

a > 



Z23 ^ ' Pt 

(x 2 +x 3 )/t 2 _ ^ 



Zl3 — Z23 



N W 

~^--Z23 



respectively. Combining these two inequalities to remove a, we obtain the stated lower bound Sma- 

We note that the corresponding optimal choice a = n tt — ^77 ■ 

P B P (e^/* 2 - 1)/Z 23 + e x '^{e X3 ^ - l)/Z 13 

Interchanging the roles of Z13 and Z23, we get the stated SNR lower bound for the case of 

Z13 < Z23. When Z\% = Z23, the third inequality in (fT4l gives the stated lower bound Sma- The 

choice of optimal a in this case is exactly the same as the one in the case of Z13 > Z23 (or 

Z13 < Z23). 



C. Solution to optimization problem ([7]) 

Suppose that t\ and i 2 are fixed, satisfying both the non-negativity and total-time requirements. Then 
we can view the optimization problem £[]) as a convex optimization problem in x\, X2 and X3. Rewriting 
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it in standard form [23]: 



min S(*i,i 2 ) = hS C B + *2<Sma 

Xx,X 2 ,X 3 



Subject to i. total-data requirement: x\ + X2 + x 3 = K 

(15) 

ii. non-negativity requirements: — xi, — X2, —x 3 < 
Since this optimization problem is convex, the rate tuple (21,22,^3) is a solution if it satisfies the 

following KKT conditions: 

3 

Kl. VS + AiV(-»i) + //V(xi + x 2 + x 3 - K) = 
K2. Ai(-x/)~= for i = 1, 2, 3 
K3. ^ > for i = 1,2,3 
K4. -Xi < for i = 1,2,3 
K5. xi + X2 + X3 = if. 

Our approach to solve the original optimization problem dTJ is to first solve the sub-problem ([TBI ) for 
each pair of (i 1 , i 2 ) ? and then minimize min Xl X2 ^ X3 S(t\, £2) over all allowable pairs. To this end, we 
consider the following cases and obtain solution to the optimization problem (IT5T > by directly checking 
the KKT conditions. Note that we assume in below that both t\ and £2 are positive. For £2 = {t\ = 0), 
Lemma 13.11 tells us that the transmission in the first (second) sub-slots reduces trivially to transmission 
over the direct link from the source to the destination. Hence min XliX2]:E3 5(1, 0) = min XljX2jX3 S(0, 1) = 
(e K - 1) /Z 13 . 

1 ) Z13 > Z\2 and Z\ 3 > Z23: From Lemma 13.11 

§(t 1)t2 ) = ±L _A + *L e * a /*i L*i/h _A + Jl (>/fa _A + ^x 2 /t 2 Lx 3 /t 2 _ 1 
Z\2 V ' Z\ 3 \ ) Z23 V / Z\ 3 V 

The condition Kl yields 

_L e (* 1+ z 2 )/ fl _ Al + M = o 

Z\z 

— ^/fa + J_ e x 2 /h^ e xi/u _ -q + _L e x 2 /t 2 + _L e x 2 /t 2 ( e x 3 / t2 _ X ) _ A2 + /U = o 

Zyi ^13 -^23 Z13 

(x 2 +x 3 )/t 2 _ Xs + fl = Q _ 

Z\z 

It is then easy to check that the following solution satisfy the KKT conditions: 



J_ + _1 1_ + J_^K _ x s 

Z\2 Z23 Z13 Z\3 


K 



Xi 


= Kt h 


Ai 


x 2 


= 0, 


A 2 


x 3 


= Kt 2 , 


A3 



2) Zi3 > Z\2 and Z\3 < Z23: From Lemma [3TT1 

§(t u t2) = — (e x ^ - l) + iLe*»/* (>/** _ 1) + Jl. (>/* _ 1) + _^_e«3/fa (>/fa 
Z12 V / Z13 V / Z13 V / Z23 ^ 

The condition Kl yields 

_]_ e (x 1 +x 2 )/t 1 _ Xl+fi = Q 

f 3 1 1 

e*»/*i + ^jt x ^jt x _ y + e (x 2 +x 3 )/t 2 _ Aa + ^ = o 

Z12 Z13 Z23 

e ^/t 2 + e x 3 /t 2 ( e x 2 /t 2 _ X ) _ Aa + ^ = _ 

Z13 Z23 

It is then easy to check that the following solution satisfy the KKT conditions: 

x x = Kti, Ai = 

^2 = 0, A 2 = — - — + — e K 

Z12 Z13 Z23 

^3 = Kt 2 , A 3 = 

^13 

JJ Z" 13 < Z" 12 and Z" 13 > Z23.' From Lemma [3TTI 



Sfa, i 2 ) = e^/t! _ x + - e 81 /* 1 e 1 ^ 1 - 1 + -M e* 2 ^ - 1 + -^ e - 2 / t2 e 

Z13 V / Z12 V / Z23 V / Z13 V 

The condition Kl yields 

_L e *i/*i + J_ e *i/*i(e« a /ti _ 1) _ Al +/; = 
zp Z12 

gCsx+a&J/ti + e ^/t2 + & x 2 /t 2 ( e x 3 /t 2 _ 1) _ ^2 + /U = 

•^12 -^23 -^13 

e (^ 2 +^ 3 )/t 2 _ Ag + ^ = Q 

Z13 

It is then easy to check that the following solution satisfy the KKT conditions: 

x\ = Kt%, Ai = 

X2 = 0, A 2 = —e K + — - — 

Z12 Z23 ^13 

X3 = Kt 2 , A 3 = 

^13 

4) M H {Z l2 ,Z 2 3) < Z 13 < mm{Z 12 ,Z2 3 }: From Lemma £TJ 
S(t u t 2 ) = — (e Xl / u - l) + iLe«i/*i _ 1) + A. (>»/<> - l) + A_e*»/fa (>/*» 

Z13 V / Z"i2 V / Z*i3 V / Z23 V 

The condition Kl yields 

J_ e *i/*i + _L e ^/*i( e ^Ai _ 1) _ Ai + // = 
Z13 ^12 

e (xi+X a )/*l + e (^2+X'3)/t 2 _ A 2 + ^ = 

Zl2 -^23 



^12 ^23 

i-e*'/*" + J_ e *»/fe _ X ) _ a 3 + ^ = 0. 

•^13 -^23 
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It is then easy to check that the following solution satisfy the KKT conditions: 

x\ = Kti, Ai = 

X2 = 0, \ 2 = (-L + -L- -L)e K 

Z12 Z23 Z13 

x 3 = Kt 2 , A 3 = 

^13 

5) Z13 < Mh{Z\2, Z23): The expression for S(t\,t2) in case 4) still holds. However, we need to 
consider the following two sub-cases in order to express the solution to the optimization problem (fT5T > : 
a) K > Mh (log A\ , log A 2 ): 

i. For 1 — ; K . < t\ < , K , , the following solution satisfies the KKT conditions: 

log Ai — 1 — log A 2 ' fc 

x\ = Kt\ — t\t2 log Ai, Ai = 

x 2 = t 1 t 2 log(A 1 A 2 ), A 2 = 

x 3 = Kt 2 - ht 2 logA 2 , A3 = 

„ ^_ e K+t 2 log A 2 ^_ e K+t 1 logA 1 

Z\2 Z 2 3 

ii. For lo ^4 2 < t\ < 1, the following solution satisfies the KKT conditions: 

si = Ifii — tit 2 logAi, Ai = 

x 2 = Kt 2 + t 1 t 2 logA 1 , A 2 = 



-^23 Z\ 2 



Z\z Z 2 z Z\ 2 



iii. For < t\ < 1 — , ^ , the following solution satisfies the KKT conditions: 

xi = 0, Al = y--y--^ A2 

Z13 A 12 ^23 
X 2 = Kti +tit 2 log^ 2 , A 2 = 

x 3 = Kt 2 - ht 2 log A 2 , A 3 = 

„ l_ e K+t 2 \ogA 2 _ 1 c K/t 2 _ 

Z\2 Z23 



b) K <M H {\ogA 1 ,\ogA2): 
K <t<l _K 

\ogA 2 — 1 — logAi 



i. For 1 K A <t\<l— , ^ , the following solution satisfies the KKT conditions: 



xi = 0, A x = J- - J- - -U^ 

Z13 Z12 Z 2 3 

x a = K , A 2 = 

X3 = 0, A3 = -L - J- - 

^13 z 23 ^12 

^ = Lg^Ai _ _ e ^/fe. 

Z12 Z23 
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ii. For 1 — i Q g^ i < t\ < 1, the following solution satisfies the KKT conditions: 

x\ = Kt\ — t\t2 log Ai, Ai = 

x 2 = Kt 2 + ht 2 logA 1 , A 2 = 

1 1 1 Klf 

X3 = 0, A 3 = — - — - ^e^ 1 

^13 z 23 z 12 

^ ' i _ e K+t 1 logA 1 _ _}_ e K/t K 

iii. For < t\ < lp ^ , the following solution satisfies the KKT conditions: 

xi = 0, A 1 = J- - J- - 

Z13 Z12 Z23 

x 2 = i^ti + tit 2 log^ 2 , A 2 = 

x 3 = #i 2 - M 2 log-4 2 , A 3 = 

„ = ~}_ e K+t 2 \ogA 2 _ _}_ e K/t 2 _ 

Z\2 Z23 

For cases l)-4), direction substitution of the solution yields mm XljX2jX3 S(t\, i 2 ) = (e^ — l) /Z13. 
Since this solution is independent of the choice of (ti,t 2 ), the solution to the optimization problem (fl} 
in these four cases is simply [e K — 1) /Z 13 . Also note that these four cases can be collectively specified 
by the condition Z13 > Mh{Z\2, Z23). 

For case 5a), the three functions S\(K), §2{K), and S^{K) respectively described in (fj), ©, and © 
can be obtained by direct substitution of the solutions in the 3 sub-cases (i., ii., and iii, respectively), and 
then minimizing the corresponding tl\w\ Xi X2 X3 5"(ti,t 2 ) over the range of t\ specified in each sub-cases. 
Hence the final solution of the optimization problem dT) is obtained by finding the minimum among the 
these three functions. For case 5b), a similar procedure yields the fact that the solution to the optimization 
problem CO) is the minimum among the three functions S\(K), 5^ (if), and S3 (K) respectively described 
in 0, ©, and ©. 



D. Proof of Corollary \3.1\ 

The proof of the results in this corollary is based on the fact that B\ (K) is the (normalized) solution 
to the optimization problem £T|) and the form of B\{K) described in Section HlI-A.il 

1) Fix Z13, Z\2, and Z 2 3. From the description of B\(K) in Section llII-A.il since B±(K) is trivially 
continuous and non-decreasing in K when under the condition Z13 > Mh(Z\2, Z23), it suffices to 
consider B\(K) under the condition Z13 < Mh(Z±2, Z23), which is assumed for the rest of the 
proof. 
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For convenience, let us denote the respective functions inside the min operators of Si(K), S2{K), 
S 3 (K), St(K), S 2 (K), and S 3 (K) by the addition of the sign '. Define, for < *i < 1, 



e K -l 

e K -l 

SUK) 
e K -l 

S[(K) 
e K -l 

e K -l 
SUK) 
e K ~l 



if K > M H (logA 1 ,logA 2 ) and max jo, 1 - ^] < h < min l} 
if K > M H (logA 1 ,logA 2 ) and minj^,!} < t x < 1 
if K > M H (logA 1 ,logA 2 ) and < h < max{o,l - 
if < K < M H {\ogA 1 ,\ogA 2 ) and 



K 
log A 2 



if 0< if < M H (logA 1 ,logA 2 ) and 1 



if < K < M H (log^i,log^ 2 ) and < t x < 



K 
\ogA x 

K 
logAi 

- lT < tl < 1 

K 
logA 2 



<h<l 

K 



(16) 



Notice that for any fixed < t\ < 1, B(K,ti) is piecewise continuous, with six pieces over the 
respective ranges of K that they are defined. Also it is easy to check that at each end point where 
two adjacent pieces meet, the values of the pieces coincide (and hence the definition of B(K,t\) 
above is valid). Thus B(K,t\) is continuous in K. The same argument with fixed K shows that 
B(KAi) is continuous in t\. Then B\(K) = min B(KAi) and hence is continuous in K. 

0<ti<l 

Now to show Bi(K) is non-decreasing in K, it suffices to show that, for each fixed < t% < 1, the 
functions fjrrr, ^-i > e*— l ' ^-l ' ^-l ' an< * ^-l are a ^ non -decreasing in the corresponding 
ranges of K that the functions are used in the definition of B{K,t\) in (fl~6l) above. To this end, 
we will repeatedly employ the following form of Young's inequality: 

x t y l ~ t < tx + (1 - t)y, 

for nonnegative x,y, and < t < 1. 

Fix < ti < 1. First let us consider Si (if). For if in the corresponding range in (fl~6l) . 



But by Young's inequality, 



1 

Z13 



•^23 



.4 



l-t* 



Z12 / ( e ^ - 1) 



2 ' 



1 



Al 

Z 2 3 



A l-t* 



z 



.12 



> 



--(- 

Z\3 \Z 2 3 
1 l-t 
Z\3 

0. 



l-t* 



z- 



23 



1 1 

Z13 Z\ 2 

t* t* 

Z\ 2 



t* 



- H 

-^13 ^12 



1 

,^12 
l-t 



Z 



+ 



1 1 

-Zl3 ^23 
-t* 



l-t* 



13 



z 



23 



Thus i s non-decreasing. 



Next consider S' 2 (K). For if in the corresponding range in (fl6l) . 



d ^(if) 



A" 



1 

^12 



(1 - t)e*/* + t - e 



Jf(i-t)/t 



+ 



l-t 



■^23 



-^12 -^13 
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Again by Young's inequality, 

(1 _ t ) e K/t + t _ e K(l-t)/t > e K(l-t)/t . x t _ e K(l-t)/t 



and 



1-t- At t t 1-t t t / 1 x 1 ' 

+ 



2) When K is sufficiently small, B\{K) = — -. Then a simple application of L'Hospital's rule 



. - ■ {--- 

Z23 Z\2 Z13 Z23 Z12 Z13 \Z23 J \Zi3 Z\2 

1-t t t 1-t t t 
> j 1 

Z23 Z\2 Z\3 Z23 Z\3 Z\2 
= 0. 

Hence S £ 2 k^I is non-decreasing. Finally, we note that the non-decreasing nature of the functions 

S e 3 K^l , fx 5i » S e 2 K-i ' an( l S e 3 K-i can rje proven in the same way. 

Si(K) 

e K -r 

gives the desired result. 

S ( 

3) When K is sufficiently large, B±(K) = . Then simply taking limit gives the desired result. 

e A — 1 

4) Fix K and t\. Augment the definition of B(K,t\) in (fT6l ) by adding B{K,t\) = ■^■^jrrr when 
Z13 > Mff(Zi2, Z23). For the rest of the proof, this augmented B(K,t\) will be considered as a 
function of Z13, Z\2, and Z23, despite its notation. Then an argument similar to the one in part 1) 
can be employed to show that B\{K) is continuous in each of Z\3, Z\2, and Z23 for every K > 0, 
except for Z13 = Z\2 = Z23 = at which B\(K) becomes infinite. 

Again similar to part 1), in order to show B\{K) is non-increasing in each of Z13, Z12, and Z23, 
we only need to show that for each fixed K and t\, the functions S[(K), S' 2 (K), S' 3 (K), S[(K), 
S' 2 (K), and S' 3 (K) are all non-increasing in each of Z13, Z12, and Z23, over the respective ranges 
of these functions shown in (fT6l ). Indeed, this fact can be shown by verifying that the derivatives 
involved are all non-positive. The only interesting case is d ^ z , which needs the use of Young's 



inequality: 



^ = -4- K [t^r 1 + (1 - tiW 1 ] - 1} 

azi3 z 13 



< -^ T {e K {A l A 2 r t ^-l} 

Z 13 L J 

L | e if-t 1 (i-* 1 )(iog^ 1 +iogyi 2 ) _ A 

Z\3 *• ' 

< -J T \e K - t ^ 1 - t ^ K -l}=0, 

Z-i q I J 



J 13 

where the second line is due to Young's inequality and the last line is due to the fact that K > 
(1 — t\) log A\ and K >t\ log A2 in the range of interest of S[(K). 
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E. Proof of Lemma 13.21 

The case of t% = trivially requires X2 = x 3 = 0, and hence §ma = 0. So we consider < t 2 < 1. 
Suppose that the transmit power of the relay is P2 and the transmit power of the source is Pi + P3, where 
Pi is the power employed to transmit the flow of rate X2A2 while P3 is the power of the flow of rate 
£ 3 /£ 2 . Then the transmission procedure in the second time slot of HDP2 (cf. Section Ull-Bb describes 
the transmission over an equivalent two-user Gaussian MA channel in which one user of rate ^2/^2 has 
power (V Z\ 3 P\ + \J Z23P2) 2 and another user of rate xs/t2 has power Z\ 3 P 3 . From the capacity region 
of this Gaussian MA channel specified by [22, Ch. 14], Pi, P2 and P3 must satisfy: 

t 2 \N Wj 

X2 c ( (^PT + ^) 2 \ 

X2 + X3 ( {Jz^p x + VM) 2 + z 13 p 3 \ 

To minimize the total power (energy) given the rates of transmission, we consider the following 
optimization problem: 

minPi +P2 + P3 
subject to /i(P) = a- Z13P3 < 

/ 2 (P) = b - {y/Z^Pl + y/Z^) 2 < 

/ 3 (P) = c - WZ^P X + ^Z^P 2 ) 2 - Z 13 P 3 < 

/ 4 (P) = -Pi < 

/ 5 (P) = -P 2 < 

/ 6 (P) = -P 3 < 0, 

where P = (P 1 ,P 2 ,P 3 ), a = N W(e x ^ t2 - 1), b = N W(e X2 / t2 - 1), and c = N W(e^ X2+X3 ^ t2 - 1). 
Notice that c > a + b. It can be shown that this is a convex optimization problem. The power tuple 
(Pi,P 2 ,P3) is a solution if it satisfies the following KKT conditions: 

6 

Kl. V(Pi + P 2 + P 3 ) + Wi(P) = 
K2. Ai/j(P) = for i = 1~2, . . . , 6 
K3. Ai > for i = 1,2,... ,6 
K4. /i(P) < for i = 1,2,... ,6. 
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The condition Kl yields 



1 - (A 2 + \MZ n P l + ^Z 2Z P 2 ) x j^- - A 4 = 

^23 



Z 



1 - (A 2 + A 3 )(v% 3 Pi + v%3^)y -f- - A 5 = 
1 - (Ai + X 3 )Z 13 - A 6 = 0. 

It is then easy to check that the following solution satisfies the KKT conditions: 

= (c-a)Zi3 = _L 1 

(Z13 + Z23) 2 Z13 Z13 + Z23 

p _ {c-a)Z 2 3 A = 1 

{Z13 + Z23) 2 Z13 + Z23 

O 

P3 = -=—, A2 = A4 = A5 = A6 = 0. 

z 13 

Then normalizing the sum of this choice of Pi, P2, and P3 by NqW gives the stated expression of 5 M a 
in Lemma! 



F. Proof of Theorem I4.il 

To prove the theorem, we need to use the following result 
Claim 1: For any x > z > 0, 

1 1 1 



Proof: 



Pr ( — + — — > - ) = 1 - 2xK l (2x)e~ 2x+z 
Zu Z 23 + z x 1 



Pr / JL + _A — > l\ = f° Pr (_L > I — L_ 1 e -v dy 



Z\2 Z23 + Z xj J \Zi2 x y + z 



x—z ^00 

e~ y dy + / I 1 — e *~y+^ ) e~ y dy 







00 



x—z 



= l-e~ 2x+z J°° e~( y+ ^)dy, 

where the integral in the last line is an integral representation of the function 2xK\ (2x) [24, pp. 53] or 
[25, pp. 969]. ■ 
We note that the same result is obtained for the special case of z = in [26] using moment generating 
functions of exponential random variables. 
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1) By Theorem ED 

> Pr I ls< - 1 - 1 U is < 



1 - Pr [ i S > 



Z\2 + Z13 
1 



1 







Z12 + Z 13 
1 



-^13 + -^23 
1 



•^13 + -^23 
1 - Pi ( Z, 2 > --zj -Pr I Z 23 > 



e z dz — 
1 - 2e~^ + e 



o 



e *cfe 



e s+ z <i2 



It is also easy to see that lims'_ +00 1 2e 1 yg^ e - = 1. 



2) By Theorem EJl 



Pdf(K,S) = Pv{S<B DF ) 

1 



Pr S < 



Z 



18 



Z i3 > ^12 • Pr(^13 > Z 



12 J 



+ Pr S < 



Z\2 + Z23 



Zl2(Z\3 + Z23) 



Z 13 < Z12 • Pr(Z 13 < Z 



12 • 



A simple calculation shows that a = ^ ^1 + e s ) — e s • Conditioned on the event {Z13 < Z12}, 
< U {S < ^} C {S < z$0^ \ Hence 

1 



b > Pr 5 < 



1 



Z 



12 



vj{S< 



Z13 + Z23 



Z 13 < Z12 • Pr(Z 13 < Z 



12 



Pr(Z 13 < Z12) -Pr 5> 



Z12} 



Pr(Zi2 > z) — Pr I Z12 > max < z 



n <^ 5 > 



1 



Z13 + Z23 



n {z 13 < Z12} 



Pr I Z 23 > — - z 



e z dz 



1) e~ z dz 



- 1 — e s — — e s . 



S 



It is also easy to see that lim,s_ 



.OO 1 / S 2 



1.5. 



3) By Corollary EU Si (if) < lim^/^oo £i(if') for all K > 0. Note that when if is sufficiently 
large, B\{K) = S J< K l ■ Now instead of choosing the optimal t* in ©, we choose t\ = 1/2 and 
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normalize the suboptimal solution by the factor e K — 1. Taking limit as K — > oo, we get 



Bi = { 



1 / 1 



1 



Z23 \Z\3 Z\2 

1 



+ 



1 / 1 



12 \^13 



Q Z- 



23 



if Z13 <M ff (Z 12 ,Z 23 ) 

if Z13 >M H (Zi2,Z 23 ). 



Obviously, lim^/^oo B\{K') < B\ because of the suboptimality of the choice t\ = 1/2. Thus, 
Pi(K,S) < Pr(S* < Bi). Moreover, when Z 13 < M H (Z 12 ,Z 2 3), 



Z23 \Zi3 Z\2 J U Z\2 \Z\3 Z23 



1 Z 12 



2 V Zia 



1 + 



1 Z 2 



2 V ^13 



V Z12Z23 



< 



< 



9 / ^12 + ^2 
2Z 13 



1 



\fZv2~Z: 



23 



2 / 1 1 

^13 V -Zl2 -^23 ' 



where the second line is due to the concavity of the square-root function. Hence 



Px(K,S) < Pr(S<-L 

^13 



1 1 1 \ (I 1 1 

1 > ) • Pr I 1 > 

Z\2 Z23 Z\3j \Z\2 Z23 Z\3 



Z\3 V Z12 Z23 ) \Z\2 Z23 Z\3j 



(± 



1 



By Claim [Q 



~ ( 1 1 1\ 

Pr + > - e~ z dz 

n -^23 2/ 

i_ 

3 [l - 2zK 1 (2z)e~ 2z } e~ z dz 
1 . -e3 - / S 2zK 1 (2z) e - 3z dz, 



and 



^2 

s 



Pr 



S 2 z ^ 1 1 , 1' 

2 Z12 ^23 ^ . 



c<2, 

O A 



z dz 



e 



e z dz. 
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Now by repeated uses of L'Hospital's rule, we have 



lim 



S^oo 1/S 2 



lim 



l-e u - jQ2zK l (2z)e- 3z dz 



lim e 

u->0 



_ u l-2uK 1 (2u)e 
2u 



-2u 



lim 2u[K 1 (2u) + K (2u)]e 

1, 



-2 a 



where the third equality is due to the fact that the derivative of — uK\{u)e~ u is u[Ki(u)+Ko(u)]e ~ u 
[26], and the last equality is due to the facts that lim u ^o uK\ (u) = 1 and lim u ^Q uKq(u) = 
[24]. To find the asymptotic order of b, let us write 



dz. 



First, we note that 



e s c < o < c. 



Then again by repeated applications of L'Hospital's rule, we have 



lim 



S-oo log(S)/S 2 



lim 



8 lim 



J Q VZU 2zK 1 (2z)e- 2z -^K 1 (^) 



e -«t 



4 lim 2V2u 

u-+0 



— ^-u log u 

J2V^[^l(^) + ^0( a; )] e " X ^ 

— log u — 1 
#i(2V2«) +ifo(2>/2«) 



-2v / 2?I 



where the second equality is obtained by a change of integration variable after the use of L'Hospital's 



rule. As a consequence, lims_ 



•°° log(5)/S 2 



4) By Corollary EU B X (K) > \im. K , ^qBx(K') = mini 
addition, the bound is achieved as K approaches zero. Thus 



1 1 1 

ZlZ ' ^23 Z\2 



for all if > 0. In 



fMK-S) > Pr(S<min(-L + 

L Z13 Z23 Z12 

P. [S<^-)-Px (s<J_ + J- 

^13/ V Z 23 z 12 



1 - e" 
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where the last line is due to Claim Q] and the bound is achieved as K — > by monotone convergence. 
Moreover, 



lim 

S^oo 



1 — e s 



1/S 2 



1 - 2uifi(2u)e- 2u ,. l-e~ w 
lim • lim 

u^Q U u— >0 U 

2 • 1. 



5) By Corollary 13.21 and similar to part 3), we have 



P 2 (K,S) < Pr(S<-^ 

^13 



1 1 1 \ / 1 1 1 

I > ■ Pr 1 > 

Zl2 Z13 + Z23 Z13 J \Zi2 Z13 + Z23 Z13 



+ Pr 5 < 



Z13 V Zi 2 Z13 + Z23J \Z\2 Z13 + Z23 Z\3 J 



+ 



•Pr 



( 1 1 
' + — < 



By Claim [Q 



/ S Pr(-L + -i— >-) e~ z dz 

JO \Zl2 Z 23 + Z z) 

2zK 1 (2z)e- 2z dz, 



1 — es 



and 



V 2 



S 2 z 1 



1 



Pr I — < + 

o V 2 Z 12 Z 23 + z 

2z 



< - I e z dz 

z 







2zK x {2z)e 



dz. 



Now by repeated uses of L'Hospital's rule, we have 

1 - e u - /„" 2zKx(2z)e- 2z dz 



lim 



5^oo 1/S 2 



lim 



lim 
1/2. 



e~ u - 2uKi(2u)e 
2u 



-2u 



As derived in part 3), lim 



5-00 log(S)/S 2 
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6) By Corollary 13.21 and similar to part 4), we have 



P 2 (K,S) > Pr LS<min 



1 



1 



+ 



1 



s { 1 1 

Pr [ S < — + — — 

Zj\2 ^23 + z 



Z±3 ' ^13 + Z 23 Z\ 2 

e~ z dz 



1 — e s 





(2 




\s 



e s 



e s . 



dz 



where the equality in the third line is established by Claim Q] and the bound is achieved as K —> 
by monotone convergence. Moreover, 

2 *i (I) 



1 — e s — -o 
lim — 

s-tL i/s 2 



e s 



lim 



1- e~ u -2v?K 1 (2u)e 



-2u 



lim 6 - 2 "^( 2 ") e 2 " + r im 2u[K x {2u) + K (2^)]e- 2u 

M^O 2lt M— *0 

, -e" n - 4«[^fi(2«) + J Fsr (2u)]e- 2u 

lim h 1 

2 

1/2 + 1 = 3/2. 



G. Proof of Theorem \4.2\ 

To prove part 1) of the theorem, we employ the Fano inequality as in [15]. For the remaining parts, 
the achievability proofs are based on extending the Feinstein lemma [29], [27], [28] to the various cases 
of interest. 

1) Suppose that K is (e, P t )-achievable. Hence, for any < 7 < K, there is a sequence of 
(n,M n ,e n ,P n ) -codes satisfying e n < e + 7, ±logM n > K - 7, and P n < P t + 7 a.s. for 
all sufficiently large n. Let M be the uniform random variable representing the message being set 
from the source to destination. Since M is independent of Z, conditioned on the link realization 
Z = (Z13, Z12, Z23), by the Fano inequality, 

Pr(error|Z= (Z 13 , Z 12 , Z 23 )) > 1 - _L_[/(M; Y n \Z = (Z 13 , Z 12 , Z 23 )) + 1] 

logM n 

> 1 - -7^ -MM; Y n \Z = (Z13, Z 12j Z 23 )) + 1], (17) 

n(A — 7) 

for all sufficiently large n. Since the relay channel is memoryless conditioned on Z = 
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(Z13, Z12, Z 23 ), by the same argument as in the proof of Theorem 4 in [3] 

I(M;Y n \Z = (Z 13 ,Z 12 ,Z 23 )) 

< n-mm{I(X 1 ,X 2 ;Y\Z = (Z 13 ,Z 12 ,Z 23 )),I(X 1 ;Y,Y 1 \Z = (Z 13 , Z 12 , Z 23 ))} 

< n- Ri h (Z 13 ,Z 12 ,Z 23 ;P t + 7) 

where 



(18) 



R\h(Zi 3 , Z 12 , Z 23 \ P t + 7) 

' Pt + 1 
N W 



max min <; C 

0<o,/3<l 



aZ 13 + (1 - a)Z 23 + 2^(1 - P)a{l - a)Z 13 Z 



2.3 



The two conditional mutual information terms on the right hand side of the first inequality in (fT8l) 
are based on the element-wise conditional pdf Py,y 1 \X 1 ,X 2 ,z(Vi yi\xi,x 2 ) and conditional input pdf 
PXiyXa\z( x ii x 2) based on the code as in [3] and the second inequality is due to the fact these mutual 
information terms are maximized by independent Gaussian inputs X\ and X 2 [5] and P n < P t + 7 
for sufficiently large n. 

Now putting ( fT8l ) into ( fTTl ) and noting that Pr(error|Z) > 0, we have 

Rlb(Zi3,Zi2,Z 23 ;Pt + j) 1 



max < 1 



e + 7 > e n = E[Pi(error\Z)] > E 



for sufficiently all large n. Since 7 is arbitrary, 



n{K — 7) 



,0 



e > E 



, , R\b(Zi 3 , Z i2 , Z 23 ; P t ) 
max < 1 — , 



>5Pr(i? lb (Z 13 ,Z 12 ,Z 2 3;Pt) < (l-S)K) 



for all < 5 < 1. From Theorem 12.11 we have then e > 5Pib ^if, e(1 e ^ ) 7 < 1 _ 1 S^j . 
2) We employ the approach of block encoding and parallel Gaussian channel decoding suggested in 
[30]. First we divide a time slot into h = yfn sub-slot^j. We are to send n — 1 messages, each 
coming from one of Ma possibilities, in the whole time slot. Thus in each sub-slot, we have 
n = V2W symbols. Let J\f(fi,a 2 ) be the Gaussian distribution with mean \i and variance a 2 . 
Consider the following code construction. 

a) Codebook generation: Independently generate Ma n-element vectors m,u 2 ,... ,UMf, with all 
elements in the vectors distributed according to i.i.d. A/"(0, 1) Similarly, independently generate 
Mn n-element vectors v\, v 2 , . . . , v Mr with all elements in the vectors distributed according to 
i.i.d. Af(0,l). 



4 For notational simplicity, we assume y/n is an integer with no loss of generality. 
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b) Encoding: Let the kth message be Wk, for k = 1,2,... , n — 1, and wq = 1, which is known 
to the relay and destination beforehand. Let < < 1. In the kth sub-slot, the source sends 
\//3Piv Wk + \/ {1 — 0)P\u Wk _ l and the relay sends \fPiuu, k _ 1 , where Wk-i is the estimate of 
Wk-i that the relay obtains based on the signal that it receives in the (k — l)th sub-slot. In above, 
P l = a P t /2W and P 2 = (1 - a)P t /2W, where < a < 1.. 

When the code defined above is employed, we can rewrite the relationship between the output and 
input symbols described by (TTOb as 



Y 



h,k 



Y 



h.k 



y/0Z 13 P 1 X? k + [V(l " W^P\ + 



^n,fe + N n,k 



(19) 



where the superscript k denotes the kth sub-slot. Hence the relay channel can be alternatively 
specified by the conditional pdf p yfl , k Y "- k \x"- k x"- k a p z(y n ' fc > Ui ' fc |5i' fc , x^' k ) with (X™' k , X^ ,k ). 
Note that since the channel state information is available at the source and relay, a and are 
functions of Z in general. This corresponds to the application of flow control. 
c) Decoding: Consider decoding of the message Wk (k = 1, 2, . . . , h — 1) under the assumption 
that the previous message Wk-i has been correctly decoded, and hence is known, at both the relay 
and destination. Fix any 7 > 0, define the sets 



T?' k (a,0,Z) = (xl k -\x^ k -\x^ k ^ k ,y^ k -\y^ k 



n,k-l „,h,k\z,n,k-l ~n,k-l ~n,k 



-log ■ — — 

n P Y *,*-\Y**\X^-\a,l3,z(y n ' k ~ 1 >y n ' k \ i 2' ) 

> 4 log Mn + 7 I 

n 

r h,ki ~n,k ~h,k\ 

-,n,k,_ a ryx J (~n,k ~n,k fi,k\ 1, PY*' h \X?*,X*> k ,a,l3,Z\yi \ X 1 > X 2 i 1 



l V ' n PYr k \X^ k ,a^z(yi \ X 2 ) U 

In the A;th sub-slot, the relay outputs wj. = i if and only if there is a unique i (from 1 to Ma) such 
that (vi,u Wk __ 1 ,y™ ,k ) € T^' fc (a, /3, Z). This allows the encoding steps mentioned above to continue 
in the (k + l)th sub-slot. In the (k + l)th sub-slot, the destination outputs = i if and only if 
there a unique i such that (vi,u Wk _ 1 ,Ui,y n < k ,y n < k+1 ) £ T 1 n ' fc+1 (a, /3, Z). 
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d) Error analysis: Let e n be the average^ error probability of decoding the whole time slot and 
F k be the event of erroneous decoding in the Mi sub-slot, for k = 1, 2, . . . , h. Then 



})■ 



£n = Pr(uf =1 F fe ) =Pr (uf =1 {F fc n 

Consider the event H f H^T-^i^l . The message Wk~2 is correctly decoded, and hence is known, 
at the relay and destination, while the message Wk~i is corrected decoded, and hence is known, at 
the relay. By symmetry of the code generated, we can assume Wk-2 = Wk-i = Wk = 1 with no 



i 

i,k 



(vi,ui,Ui,y 



loss of generality. For i = 1,2,..., M„, write E, 
and Ef >k = {( Vi , Ul ,y^ k ) £ T™' k (a, /3, Z)}. Then 

^ n (nf-^f ) = (4,) c u (4 fc ) c u (u^S 

Now for i = 2, 3, . . . , M ft , 



h,k— 1 ..n.fc 



< 



< 



T^"{a,l3,Z) 



e~ n ~> 

e -n7 



Par? ' fc_1 ,xj' fc_1 ,x 2 "' fc ,y"' fc - 1 ,y fi ' fc |a,/3,z(' t;i ' u i> y"' fc 1 > y n ' fc ) 



Py«,i=-i ! y»,fe|X™' k - 1 l Q !)( g ! Z 



(/' fc - 1 ,/- fc |m) 



T?' k (a,li,Z) 



where the independence between y n ' fc and u, in the second line is due to the fact that Y n,k and 
m are jointly Gaussian and uncorrected, and the inequality on the third line follows from the 
definition of T™' k (a, (3, Z). Similarly, we can employ the definition of T^' fc (a, (3, Z) to show that 



Pr:(E? k ) < ^ for i = 2, 3, . . . , M fl . Hence 

e n < Pr(uf =1 [(4j c U(^) c ])+^^Pr(^ fc ) + ^^Pr(^ 



h Ma 



h Ma 



k=l 8=2 

< Pr (Uf =1 [( J B 1 1 jfe ) c U (4 fc ) c ] ) + 2ne"^. 



fc=l i=2 



Putting the codes in the h sub-slots together, we obtain a sequence of (n, M? ,e n ,P n )- 
codes over the time slot with e n and P n respectively satisfying lim sup^^^ e n < 



lim sup, 



..,-oo P r ( U fc=l 



(El k ) c U (El k f 



and limsup^^oo P n = Pt a.s. 



The error probability defined here is averaged over all Gaussian codes constructed as described. Thus there is a Gaussian 
code that gives at least the same error performance. 



38 



Further note that as n (and hence n) becomes large 



I n,k—l h,k\~f l ik— 1 ~h,k— 1 ~h,k\ 
1 Pya,k-i Y"- k \X?- k - 1 ,X™- k - 1 ,X™- k ,a,f3,Z\y ' 'V ' \ X 1 > X 2 ' X 2 > 
— log — ; ; ^-r— j 

yZ 13 + (1 - a)Z 23 + 2V(1 - /?)<*(! - a)Z 13 Z 23 



a.s. / / / 



NqW 

h,k\ ~n,k ~h,k\ 



/ n,Ki ~n,K ~n,K\ 
I PY?- k \X?- k ,XZ- k ,a,f3,Z\yi \ X 1 > X 2 ) a.s. ( Ctf3Zi 3 P t 



for all A; = 1, 2, . . . , n, when the inputs symbols are Gaussian distributed as described in the code 
generation step above. 

Now set Ma = e~ and choose a and (3 so that the maximum rate Rdf in Appendix |A] is 
achieved. Since the choice of 7 > is arbitrary, the code construction argument above shows the ex- 
istence of a sequence of (n, M n , e n , P n )-codes over the time slot satisfying liminfn^oo ^ log M n = 
K, limsup^oo e n < limsup^oo Pr (^Rbf < ^fK\ = Pr (i?DF < K), and limsup^^ P n = 
P t a.s. . From Theorem O Pr (R DF < K) = P DF {K, S). Therefore if P DF {K, S) < e, then the 
rate K is (e, P t )-achievable. 
3) We construct a code that conforms to HDP1. Fix < t\ < 1 and t 2 = 1 — t\. Write n\ = \t\n\ 
and n 2 = n — n\. 

a) Codebook generation: Independently generate ni-element vectors u\,u 2 , . . . ,um\ with 
all elements in the vectors distributed according to i.i.d. N(0, 1). Independently generate M% 
ni-element vectors s\,s 2 , . . . ,s\ I2 with all elements in the vectors distributed according to i.i.d. 
AA(0, 1). Similarly, independently generate M% ?i2-element vectors vi,v 2 , . . . , vmi with all elements 
in the vectors distributed according to i.i.d. AA(0, 1). Independently generate M% ^-element vectors 
s\ ,s|> • • • j s \p with a H elements in the vectors distributed according to i.i.d. AA(0, 1). 

b) Encoding: Let M n = x M% x M%. A message w, with value from 1 to M n , can be indexed 
by the triple (i, j, k), where i ranges from 1 to M\, j ranges from 1 to M^, and k ranges from 1 
to M^. Also we divide a time slot with n symbols into two sub-slots: the first with n\ symbols 
and the second with n 2 symbols. In the first sub-slot, the source sends J ^^Ui + y ^ 1 2 ^ Pt ~s)> 
where < a < 1. The relay generates an estimate, j, of j. In the second sub-slot, the source sends 
\J fpF^fc an d tn e relay sends \J ^^y^ - s^., where < j3 < 1. Like before, a and (3 are the power 
control functions depending on the link gain vector Z. 

As in part 2) above, when this code is used, the input-output relationship of the channel can be 
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described by 



Y ni = 



r™ 1 = 



aZ 13 P t ^ 



2W 



X™ 1 + 



aZ 12 P t vni 



2W 



X™ 1 + 



'(1 


- a)Z 13 P t 


2W 


'(1 


- a)Z 12 P t 



I" 1 + AT" 1 



2W 



X™ 1 + N{ 



n-i 



during the first sub-slot with X™ 1 corresponding to the codeword Ui, X 2 1 corresponding to the 



codeword sj, and X™ 
sub-slot, we have 



f^X™ 1 + J (1 2w Pt ^2 1 as 1116 in P ut to the CB channel. In the second 



yr, 



f3Zi 3 P t 



2W 



(l-P)Z 23 P t 



2W 



X n 2 + N n 2 



with X x 2 and X 2 2 corresponding to Vk and si, respectively. 



c) Decoding: We combine the decoding approaches suggested for the CB and MA channels in 
[31] and [27], respectively. Fix any 7 > 0. Define the sets 



T?(a,(3,Z) 
T?(a,(3,Z) 
T^(a,(3,Z) 

T£(a,(3,Z) 
T£(a,(3,Z) 



x"Sy ni : -log - > -logM^+7 

n PY^\a,(3,z{y ni ) n 



■log 



n 



-log 

n 



(x^ 2 ,y" 2 ):-lo f : 
n 



n 
1 



log 



(x^ 2 ,y" 2 ):-log 





■ L \ 




Pynx ix^i, 1 




) 


fV" 1 IX" 1 ,X 2 " X ,a,/3,z(yi 1 


5 1 1 






) 




|X" 2 ,X™ 2 ,a,l3,z(y n2 




^2 2 ) 


PY"2\xZ 2 ,a,l3,z(y n2 


l x 2 


) 


PY^\xp,X^, a ,/3,z( yn2 


~™2 




PY^xp^,z(y n2 


\xT 


) 


PY^\X"\xp, a ,/3,z( yn2 


~™2 




PY"2\a,f3,z(y n2 


) 





> -logM^ + 7 

n 



> - log M*M* + 7 



> -logM^ + 7 

n 



> -logM^ + 7 

n 



> - log MlM^ + 7 



In the first sub-slot, the relay sets j = j if and only if there is a unique pair (i, j) such that 
(uj^y" 1 ) G Tg(a,f3, Z) C\T$(a, f3, Z). This allows the encoding step in the second sub-slot 
mentioned above. The destination outputs i if and only if there is a unique i such that (ui,y ni ) G 
T"(a, (3, Z). In the second sub-slot, the destination outputs (j, /c) if there is a unique pair (j, k) such 

that (sj, v k , y™ 2 ) G T 4 "(a, /3, Z) n T 5 n (a, /?, Z) n T 6 n (a, /3, Z). Finally, the estimate of the message 
is then w = k). 

d) Error analysis: Because of the symmetry of the code, we can assume w = (1, 1, 1). 
Let e n be the average error probability of decoding. For i = 1,2, ... , M*, write E\ = 

{(u t ,y^)eT^(a,(3,Z)} and for i = 1,2,..., and j = 1,2,..., Ml E% = 
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Uu h s),y^) G T 2 "(a, 0, Z) n T 3 "(a, /3, Z)}. For J = 1, 2, ... , and k = 1,2,..., M*, E% = 
[(v k , s],y n *) e T£(a, (3, Z) n T£(a, (3, Z) n T«(a, /5, Z)}. Then 

e n <Fr{(E 1 i yu(E 2 li yu(Et i y)+Y J ^(E})+ £ Pr(4) + £ Pr(^). (20) 

*=2 (i,fc)^(i,i) 

Using the definitions of Tf(a,/3, Z) to T 6 n (a,/3,Z) and similar to part 2) (see [31], [27] for the 
detailed arguments), one can show that the second, third, and fourth terms on the right hand side 
of (|20l can be bounded by e - ™ 7 , 2e _n7 , and 3e _n7 , respectively. 
As n becomes large, 



-log 

n PY"i\a,/3,Z 



— log 

11 

— log 

n 

1 



log 



n 

— log 

n 

— log 

n 







K 1 


) 




Py* 


^\x^,x^, a ,p,z( y ^ 


\~ni 




a.s. 




PY^la^ziyT 


) 








'2\X^,X^,a,l3,z(y n2 


~™2 
X l ■ 


x n 2 2 ) 


a.s. 




PY^\XZ 2 ,a,l3,z(y U2 


l x 2 


) 






'■2\xp,XZ 2 ,a,f3,z(y n2 


~™2 
X l ! 


5?) 


a.s. 




PY™2\X? 2 ,a,f3,z(y n2 


|s? 


) 




P Y r 


'■2\X" 2 ,Xz 2 ,a,l3,z(.y n2 


~™2 


5?) 


a.s. 



PY"2\ a ,/3,z( 



t x c 
he 
hC 
t 2 c 
t 2 c 
t 2 c 



aZ X3 P t 



(1 - a)Z l3 P t + N W 
(1 - a)Z 12 P t 

Z\ 2 Pt 



N W 

pz 13 p t 

N W 

(1 - (3)Z 23 P t 
N W 



pZ 13 P t + {\- (3)Z 23 Pj 
N W 



when the inputs symbols are Gaussian distributed as described in the code generation step above. 
For the cases of t\ = and t\ = 1, the channel reduces to the case of MA and CB, respectively. 
Hence the corresponding subset of code construction should be employed. 

Now let M\ = e nx \ M% = e nX2 , and M% = e nX3 such that x 1 ,x 2 ,x 3 >0 and x\ + x 2 + x 3 = K. 
Choose t\, a, f3, x\, x 2 , and x 3 as functions of Z to minimize P t . Since 7 > is arbitrary, by 
Theorem 13.11 the code construction above shows the existence of a sequence of (n, M n , e n , P n )- 
codes over the time slot with satisfying liminf^oo ^logM n = K, lim sup^^ e n < P\(K,S), 
and limsup n _ +oc P n = Pt a.s. 

Indeed, to see that Theorem 13.11 applies here, it suffices to show that the optimization solution 
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described in Section IIII-A.ll lies within the following region 




> X\ 



> X 2 



> X\ + X 2 



> X3 



> X 2 



> X2+X3. 



The last three inequality coincide with the MA region specified in part 2) of Lemma 13.11 (see 
Appendix |B]). For Z13 < Z\2, it is easy to see the third inequality is redundant in place of the first 
two inequalities, which coincide with the CB region in part 1) of Lemma |3~T1 For Z13 > Z\2, the 
optimal solution specified in Appendix [C] can be achieved by the choice of t\ = 0, hence making 
only the last three inequalities matter. 
4) All the arguments are essentially the same as in part 3) with the modification that the source sends 



power control component, in the second sub-slot. 

H. Proof of Theorem \4.3\ 

We sketch the proof of the theorem, which employs results from [16] directly. Below we use the index 
i to denote one of the four cases of lower bound (i = lb), decode forward (i = DF), HDP1 (i = 1), 
and HDP2 (i = 2). 

For i G {DF, 1,2}, replacing P t by S(Z)[e K - 1]N W in the proofs parts 2) - 4) of Theorem 
given in Appendix [Gl we can show the existence of a sequence of (n, e nK , e n , P n ) -codes with P n < 
S(Z)[e K — 1]NqW + 7 a.s. and e n < Pi(K, S(Z)) + 7, for any 7 > 0, whenever n is sufficiently large. 
Note that we write the RNSNR S(Z) to highlight the use of a general power control scheme which 
varies the total transmit energy (rather than setting it to a fixed value as in the original proofs) in each 
time slot according to the link gains. 

Consider the optimal power control function Si{Z) that solves the following optimization problem: 




and the relay sends 




si, where < 5 < 1 is an additional 



mm s(z) Pi(K,S(Z)) 
subject to E[S(Z)] < S 



N W(e K -l)- 



Pt 



(21) 
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Write B\ h , B DF , Bi(K), and B 2 (K) as Bi(Z) for i = lb,DF,l, and 2, respectively to highlight their 



link gain vector Z. Let s* = sup{s : Si(s) < S}. Then Proposition 4 of [16] can be applied to solve the 
optimization problem in (1211 ). provided that Bi(Z) is continuous for all Z ^ and is non-increasing in 
each of the elements of Z (c.f. Lemma 2 of [16]). This latter condition is established in Corollaries 13.11 
and 13 .21 for i = 1 and 2, respectively. For i = lb and DF, the condition can be checked by straightforward 
calculus. The resulting solution is 



and Pi(K, Si(Z)) = Pr( J B i (Z) > s*). Now, if E[Bi{Z)} = lim^oo S;(s) is finite, setting S = E[Bi(Z)] 
will make s* = oo and hence P^K, Si (Z)) = as well as E[Si(Z)} = E[Bi(Z)}. 

For the cases of i = DF, 1, and 2, this implies that the rate K is long-term E[BDF(Z)](e K — l)^^- 
achievable, long-term E[B 1 (Z)](e K - 1)N 1-F-achievable with HDP1, and long-term E[B 2 (Z)](e K - 
l)A^oW / -achievable with HDP2, respectively. For i = lb, a slight modification to the proof of part 
1) of Theorem 14.21 shows that if the rate K is long-term S(e K — l)iVoVF-achievable, then the error 
probabilities of the sequence of codes (and the corresponding power control schemes S n {Z)) must satisfy 



E[S n (Z)] < 5+7 and 5 Pa, (K, e(1 e _^ )if 1 _ 1 5 w (Z) j < e n < 7 for all < 5 < 1 and any 7 > 0, whenever n 



is sufficiently large. Since is continuous in the second argument, this requires that P\b(K, S n {Z)) = 
for all sufficiently large n. The solution of (|2TI ) then implies that S > E[Br,(Z)]. 

Since B ih (Z) < B DF (Z) < B 2 (Z) < B 1 (Z) for all Z, it suffices to establish the finiteness of 
E[B\(Z)\. From the proof of part 3) of Theorem gH 



dependence on Z. Define Si{s) = f. 



{Z:Bi(Z)<s} 



Bi(Z)dFz, where Fz is the distribution function of the 





v ^13 




if Z13 >M H {Z 12 ,Z 2 3). 



Thus it in turn suffices to establish the finiteness of E[B\{Z)\. Indeed 





a 




(22) 



b 
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where the second equality is obtained by using the integral representations of Kq(x) and K\(x) in [25, 
pp. 969]. Again using the property of the modified Bessel functions, it is easy to check that the integrand 
in the integral a on the right hand side of (1221 is bounded above over the range of < x < 1 and is 
bounded above by e~ x for x > 1. Thus a is finite. On the other hand, we have 




where the first integral on the right hand side is v2T (~) (see [25, pp. 942]) and the second integral is 
finite (see [25, pp. 733]). Thus b is also finite. 
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Fig. 1. The classical 3-node relay channel. 



TABLE I 

Decibel losses in performance with respective to an AWGN channel. 





E[B DF ] d B 


£[£i(0)] dB 


^[ J B 1 (oo)] dB 


E[B 2 (0)] dB 


E[B 2 (oo)] dB 


2.17 


2.76 


3.33 


5.45 


3.02 


5.36 
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Fig. 2. Outage probability bounds of Theorem 14.11 



47 




Fig. 3. Outage probabilities for HDP1 obtained from Monte Carlo calculations. 
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4. Outage probabilities for HDP2 obtained from Monte Carlo calculations. 
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5. Plot of delayed-limited rates for various transmission schemes. 



